You Are Where You Edit: Locating Wikipedia Contributors through Edit Histories
نویسندگان
چکیده
Whether knowingly or otherwise, Wikipedia contributors reveal their interests and expertise through their contribution patterns. An analysis of Wikipedia edit histories shows that it is often possible to associate contributors with relatively small geographic regions, usually corresponding to where they were born or where they presently live. For many contributors, the geographic coordinates of pages they have edited are tightly clustered. Results suggest that a wealth of information about contributors can be gleaned from edit histories. This illustrates the efficacy of data mining on large, publicly-available datasets and raises potential privacy
منابع مشابه
Setting the facts straight on Wikipedia.
If you enter almost any name or topic into your search engine, the first item listed will probably be fromWikipedia. It has become, by default, the world’s chief source of information. The quality of information you find there will of course be highly variable. As critics point out, anyone with access to a computer can edit a Wikipedia page. You can insert incorrect information, or add falsehoo...
متن کاملWho Did What: Editor Role Identification in Wikipedia
Understanding the social roles played by contributors to online communities can facilitate the process of task routing. In this work, we develop new techniques to find roles in Wikipedia based on editors’ low-level edit types and investigate how work contributed by people from different roles affect the article quality. To do this, we first built machinelearning models to automatically identify...
متن کاملEdit Categories and Editor Role Identification in Wikipedia
In this work, we introduced a corpus for categorizing edit types in Wikipedia. This fine-grained taxonomy of edit types enables us to differentiate editing actions and find editor roles in Wikipedia based on their low-level edit types. To do this, we first created an annotated corpus based on 1,996 edits obtained from 953 article revisions and built machine-learning models to automatically iden...
متن کاملLearning to Predict the Quality of Contributions to Wikipedia
Although some have argued that Wikipedia’s open edit policy is one of the primary reasons for its success, it also raises concerns about quality — vandalism, bias, and errors can be problems. Despite these challenges, Wikipedia articles are often (perhaps surprisingly) of high quality, which many attribute to both the dedicatedWikipedia community and “good Samaritan” users. As Wikipedia continu...
متن کاملAssessing the Quality of Wikipedia Pages Using Edit Longevity and Contributor Centrality
In this paper we address the challenge of assessing the quality of Wikipedia pages using scores derived from edit contribution and contributor authoritativeness measures. The hypothesis is that pages with significant contributions from authoritative contributors are likely to be high-quality pages. Contributions are quantified using edit longevity measures and contributor authoritativeness is s...
متن کامل